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Computer Processing Errors in the 1961 Census Wage-Earnings Data 


Introduction 


This paper deals with two errors in the computer processing of 
the 1961 wages and salaries data which have been proven to have resulted in 
serious distortions of the data. It is hoped that this information will 
enable interested users to make their own adjustments to 1961 data. 


The information contained in this paper has been gathered by 
the author in the course of two year's work as Head of the Incomes and 
Earnings Unit of the Economic Characteristics Section, Census Division. 


A brief note on the structure of the 1961 Census income data 
is in order. A question on income from wages and salaries for the 12 
months preceding the Census was asked of all wage-earners through face- 
to-face enumeration. In addition, a 20% sample of non-farm, non-insti- 
tutional households was left a self-enumeration form in which all persons 
over the age of 15 were asked questions on all sources of income except 
net farming income. In fact, therefore, the question on wages and sal- 
aries was asked twice of 20% of the wage-earners in the 1961 Census. 


: _Two fairly serious errors were made in the computer processing 
of wages and salaries data. One error involved the wages and salaries 
reported on the sample document, the other as gathered on a 100% basis. 


The root causes of the errors lie with the fact that 1961 was 
Canada's first computerized Census, and much had to be learned of the 
difficult art of communitations between subject-matter specialists and 
computer programmers. 


Error in assignment of Blanks 


The less serious of the two computer processing errors affected 
the wages and salaries data from the sample. It did not affect the wages 


and salaries data gathered on the 100% document (as published, for in- 
stance, in. the 1961 volume 3.3). 


The problem arose when the wages and salaries in the sample was 
blank. The computer was instructed to look first at any wages reported 
for the person on the other document, the one filled in by face-to-face 
enumeration. In many cases this yielded a satisfactory answer. However, 
frequently it did not. The original intention was that at this point a 
method of random assignments using criteria such as age, sex, education 
etc. would be used for these persons. In fact a figure of $3,300 was 
inserted regardless of sex, age or any other characteristic. 


This error affected the records of about 48,000 persons (weight- 


ed up from the sample), or about 1% of wage-earners. The national aggregate 


of wages and salaries was not altered appreciably since $3,300 was very 
close to the average reported wage. For males, however, this figure was 
about $400 below the reported average, and for females $1,300 above the 
reported average. 


Bs 


Evidence exists that non-responders to the wages question tend 
to have slightly lower wages than responders.(1) Therefore, the figure 
of $3,300 may be fairly appropriate for males. For females however, it 
represents a gross overstatement. In bulletin 4.1-2, tables Bl and B4, 
for instance, the average wages for females.is published as $2,051. 

Even ignoring downward bias due to non-response, the use of $3,300 for an 
assignment of blanks means that the published averages earnings figure is 
$13 too high. 


A particular subgroup for which large distortion resulted was _ 
the category "occupation not stated". This group included about 34,000 
of the 48,000 wage-earners affected by the error. As inspection of line 
Sonia bWes h4= LOwand hinem2ys, .table: BS, bulletinv4.1=2, 1961 Census 
shows, a huge concentration in the $3,000-3,499 groun resulted. Average 
incomes for this group were distorted in the same manner as for all wage- 
earners described above, although the effect was much greater. The "true" 
average for females, for instance, is at least $500 below the $2,649 pub- 
lished. 


Error in Computation of Averages 


“A much more serious mistake occurred with respect to the wages 
data gathered from all wage-earners by face-to-face enumeration. Size 
of earnings was reported in income intervals on the questionnaire, and 
for caiculation of averages each interval was given a weight. Disas- 
torously, the weight assigned to the $15,000+ class was $15,000. 


The effect on the average wage-earnings was negligible for 
females and lowered the male average $77. The effect of this error was 
negligible on all the lower-income occupations (including nearly all the 
females) where the numbers of persons reporting $15,000+ was insignificant. 
However, for certain occupations the distortion is staggering. Table I 
shows recalculated averages for the 41 worst-hit occupations, those whose 
distortion exceeded $200. 


Bluntly put, this error rendered the average earnings data as 
published, for instance, in volume 3.3 of the 1961 Census, quite uscless 
for inter-occupational comparisons of the higher-earnings occupations for 
males. 


Anyone wishing to make such comparisons, or to compare 1961 and 
1971 wage-earnings data, has two choices. He can use 1961 sample data on 
employment income as published in 1961 Volume 4.1 and the SX series of 
publications, or he can adjust for himself the 1961 tabies which he de- 
Sve Se COmUSEe. 
(1) Gower D. An Evaluation of 1961 Census Wage Data from Canvasser 
Enumeration and Self-Enumeration, Working Paper (Demographic and 
Socio-Economic Series) No. 3, Census Division, DBS. Ottawa, March 1968. 
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The individual user will, in general, lack information on the ° 
weight proper for the $15,000 class, and in most cases he will also lack 
information on the frequency of this class. The open-end interval published 
(and, in most runs, tabulated) is $10,000+. 


A user trying to adjust occupational data would probably find 
Table 1 useful. However, this would not be helpful for adjusting average 
earnings for other distributions. 


ingangattempuste provide the user with a general formula for 
adjusting tables where $10,000+ is the largest class,,the relationship 
between the 10,000+ frequency, the 15,000+ frequency, the 15,000+weight 
and the distortion among the different occupation groups was examined. It 
was found that fairly reliable estimates of the absolute distortion could 
be obtained using only one statistic: the percentage of wage-earners 
(working for wages and salaries) reporting more than $10,000. It was dis- 
covered that this figure gave a good estimate of both the percentage in 
the $15,000+ class and the weight of the class. A formula was derived by 
simple observation which seems to give decent results:* 


TUS 
@ ¥, 


a Sa ae Wea 


elo 10-7100 Leet >-10 


Where = 

a = "true! or adjusted average 

a = original average as published 

T = % of wage earners working for wages and salaries reporting more than $10,000. 


Two warnings should be observed in using this formula. First, 
it can be expected to work satisfactorily only on income distributions of 
a fairly "typical" shape. That is, unimodal, skewed, with mode less than 
median, median less than mean and a long upper-income tail stretching to 
several times the distance of the median point from the vertical axis. In 
addition, a smooth profile with no sudden changes in slope is needed. 


An example of a group for whom this formula will not work is 
airline pilots, whose income distribution has a mode of $16,000-$17,000, with 
a sharp drop thereafter and very few persons above the mode. 


*Due to data limitations, proper measurements of the variation from "true" 
values cannot be taken, except for the occupation figures. A better guide 
is the "shape of curve" criterian described above. If this is met, the 
formula can certainly be taken as an indicator of magnitude. The author 
apologizes that time and resource limitations prevented the calculation of 
more rigorous estimation parameters. 
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Second, care should be taken not to apply this formula to 1961 
wages data gathered from the population sample, for which this adjustment 
is not needed because incomes were given in digits. This exception there- 
fore applies to all data in the 1961 volume 4.1 and the "SX" series of 
publications. The formula is only applicable to the 1961 data on wage- 
earnings of wage-earners, as pubiished in volume 3.3. 


The $15,000+ averaging error affected not only the average 
earnings of certain occupation groups, but also of certain age and educa- 
tion groups. In table 2 the formula described above has been applied to. 
illustrate the adjustments required to certain published figures distributed 
by these variables. 


This error affected more than the relative figures. It was also 
responsible for a loss of $154 million from the aggregate wages and salaries 
gathered for wage-earners in 1961. Most of this loss was borne by males. 


4 4 
t ¢ ; 
, rm 7 
; j ' hnwh) aed oe 
One 24 éunevns 
‘ fa eadell 


7 
ait we 


TABLE 1. Average Wage-earnings of Male Wage-earners working 40 — 52 weeks, 
go+ Hours, Before and After Adjustment for $15,000+ Averaging Error, 
Selected Occupations,* Canada 1961 
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Unadjusted Adjusted No. persons 
average average affected 


Occupation 


1. Owners & Managers, N.E.S. in Primary 


PACA COO TACT Chad Gs MOR tcl ee Wy We ers Alums Va be whe 95303 10,996 Poe 
2. Owners & Managers, N.E.S. in Paper and 
Sree MCUSU LES, Mme. fas So Gere beta aehs O27 105,930 Za OS 
3. Owners and Managers, N.E.S. in Chem. & 
MRE oe PVCU CCS” Set pP SF MR alco c's Esl s Ma eheteee basta 9,069 LOSS 4 2 Oat 
4. Owners and Managers, N.E.S. in mining, 
Guarino and: O11 we lle str.svtntelel ss 9,407 10,844 Zo 
>. Owners and Managers, N.E.S. in misc. 
MSIGGe DIMUDSELLSS 56. rots tate tate hetnratenets eneia rishi O7a00 1,760 
6. Owners and Managers, N.E.S. in tobacco 
Pee U SMA TICKVS EY 1 GS * Bic ices tenetetecatareratews tans 9,168 HAO) 5 'sy45) 179 
Memoitdees ald MASTPEETALES aise later ams aretelerens 10,817 Note ys 588 
8. Owners and Managers, N.E.S. in Services 
COspus we Manavement: ita nie 4 tthe). 8462) 10,098 2,040 
9. Owners and Managers,.N.E.S. in Textile 
le UE Ble ete ES Rew Sak.) 8 Sane oe oe Ge 25 675 1,249 
10. Owners and Managers, N.E.S. in Leather 
ee SOME hese Fone a¥ caste its atdes <Tusdocetetencd ober: F500 8 oli 588 
Perot srcians “andusurgeons .%.. 0% Fats. geese 8,464 SIATTAS 
i2. Owners and Managers, N.E.S. in Printing 
Bee eS Hegre Od Bu.g.8 oF Mies, seth Patches ais Oo, 037 8,967 ig 1G 
13. Owners and Managers, N.E.S. in Knitting c 
eae age Apo RS ae, Ae ee S077 S0L6 343 
14. Owners and Managers, N.E.S. in Machinery 
ace: Suit megets 3 Tok al HGR in Baas are ae St Pon Do 9,412 1,396 
15. Owners and Managers, N.E.S. in Forestry 
br OS PL ie ec Sake Ge ons ae gt shee ens vis ss 7305 3,097 852 
16. Owners and Managers, N.E.S. in Elec~ 
oie Hee EOGUCiominGus tries Saas sss ms 8,623 9,380 555 
17. Owners and Managers, N.E.S. in Trans- 
portation Equipment Industries ....... Bie 24. SA7G 2 hOO 
18. Owners and Managers, N.E.S. in Food and 
Beveragem Ndusiy ies Gis, we. oh al ee 7 vee) POO are aga2l 
Pe Lowyers AitNaLarEeS: £26. awk. ok iccketels esomete alt ews 8,629 PRIS 
20. Owners and Managers, N.E.S. in Rubber 
NRVENSP NCTE 5 Ser Bae ed CRW ty 5a ge an aah ee oe 8,048 8,695 ae 
21. Owners and Managers, N.E.S. in Finance 
ancmlirsuvance $95 710k 2 Ioe.. D2 a fare She 7,580 8,226 20 , 363 
22. Owners and Managers, N.E.S. in Clothing 
Ligeti Lee eae: ake oreestsbsiat> ate ade at oke RIS 8,334 ee at: 
23. Owners and Managers, NvEwos an iMetal 
ET TOpE yah eb bel a veo0 )R stay (oe ae 0: \ oe) Sane gee er a 87,267. §,895 Seek 


(1) In this list are the 41 occupations for which the distortion exceeded $200. 
They are ranked in decending order of absolute size of distortion. 
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Unadjusted Adjusted No. persons 


Occupation 


average average affected  _ 

24. Owners and Managers, N.E.S. in Petroleum 

Perea OS TCS ARES OMEE 5) ope act Gre. erore tO aiece Se , O3203 95349 799 
25. Owners and Managers, N.E.S. in Non- 

Meta ica Minmexalue imate tact elot mretistshssl sce 7,/69 fais SL ih 33Sy4" 
26. Owners and Managers, N.E.S. in Whole- 

SeULGr UREN, hae eh rater ee Re) CA Cet ae H SOMES 75308) 19,029 
27. Owners and Managers, N.E.S.-in ; 

maga UCR LONE = Green tec a chev le-clale GIS #0), 6 40 8's Te 7,426 7,801 73895 
28. Owners and Managers, N.E.S. in Wood 

NTIS Ie CMe em Lot rye er em ate oieie ol ie acres Pehle Wg SQL 1 OZ 
Poewocetirrcy Salesman and Brokers ........... 65225 6,614 3,694 
SiO). JEST BREEN RS 26m cee ed komen cies Cee Gea ae SE ELS 8,474 1,916 
Boece uie ing Managers” . 66a ces ee ste wee tats 75016 8,151 2,025 
RE, SRE SURED qa Bis ie Gees ou ae ee Poe atl 8,268 8,602 306 
33. Owners and Managers, N.E.S. in Transport- 

ation, Commercial and other Utilities... 7,139 75460 15,468 
Cee icnienamec amc Musi]: Ceaeherns ......1.-sc. Dyas 7/ Sig Like es OS 
femmeveeerart Filots, Navigators ete. ss... 6k. Bry has 8,404 ios 
PCA MAACO TS) A lke 2 Sena nds oe ee See F518 7,840 2119 
37. Owners and Managers, N.E.S. in Furniture 

Aen iel URS (sree is meme ce car IN. Ree e's SoA be “clone gre Ey Oe 7 ea62 767 
38. Owners and Managers, N.E.S_ in Retail . 

UNTORSR CHE 15 RT Gareth cote gO tas Er a ee ge a Ds D7 “= Stelle Bi 2380) 
PU eAcrOms mm timMeeGiuainers andeShowmen 2.2... & LTS 4,680 483 
pemprem@ed ENSInCers Gass 2 det da ewe cs calees 8 ,083 8.236 ZOD 
41. Owners and Managers, N.E.S. in Health ° 

aynval Welheeraes Series: “oiavo cin oie coe Gio eee 6,460 6,663 2,071 


Sources: Unpublished data available at the Economic Characteristics Section, Census 
Division, DBS(2). 


(2) To obtain the frequency of the $15,000+ class, a special tabulation on 
wage-earners reporting 49-52 weeks, 35+ hours was used. The weight for 
the $15,000+ class was obtained from a table of employment income by 
occupation,for the labour force, tabulated from the population sample. 
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In addition, there were 17 occupations where the difference between the 
ac justed and unadjusted was less than $200 but over $100, and an additional 
14 between $50 and $100. 


Only one female occupation has a distortion of over $50, and this is female 
physicians and surgeons, which is in the $100-$200 distortion class. 
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TABLE 2. Average Earnings of Wage-Earners from Wages and Salaries, 
1961 Census, Unadjusted and Adjusted for 15,000+ Averaging Error 
(Estimated by Formula) 
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Schooling Age Unadjusted(1) Adjusted 
Me CACY). cvs see's s oss = 0s nial si elvis Total 2,964 2,964 
PCOS aks cs wicle setae ws cic ss en ome 89s Total S59. 35914 
45-54 4,778 4,794 
Pert G CY is sry cleleiccels w tees 6 6 se so es Total 3099 55819 
2534 Sirs 16 lle 5,424 
35-44 Jee 7330.0 
45-54 TR Soe” 7,017 
55-64 PeOst 7,452 


(1) As published in Bulletin 3.3-5, Table 17-1, 1961 Census of Canada. 


